Online Behavior Evaluation with the Switching Wizard of Oz
نویسندگان
چکیده
Advances in animation and sensor technology allow us to engage in face-to-face conversations with virtual agents [1]. One major challenge is to generate the virtual agent’s appropriate, human-like behavior contingent with that of the human conversational partner. Models of (nonverbal) behavior are pre-dominantly learned from corpora of dialogs between human subjects [2], or based on simple observations from literature (e.g. [3–6]). Humans are particularly sensitive to flaws in the displayed behavior, both in form and timing [7, 8]. This effect also occurs when certain behaviors are not animated, which is common in experimental settings where the behavior of the virtual agent is varied systematically only one or a few modalities [9, 10]. This leads to biased perceptual ratings, which hampers progress in the design and implementation of behavior synthesis algorithms. To this end, we propose a methodology and implementation that combines ideas behind the human Turing test with those of a Wizard of Oz setup. At the heart of our approach is a distributed (video-conferencing) setting with two human conversational partners. Each of the subjects is observed with camera and microphone and algorithms are employed to analyze the verbal and nonverbal behavior in real-time (similar to e.g., [11–13]). These observations are used as input to a behavior synthesis model. Both subjects are shown a virtual representation of the other (see Fig. 2), animated based on one of two sources: (1) directly on the observed behavior of the other, or (2) on the output of the synthesis model. Both sources share the same behavior animation capabilities and limitations. We can therefore analyze the effect of the quantity, type and timing of the nonverbal behaviors on the perception thereof. During a conversation, the source of animation of the representation of each subject switches occasionally. The idea behind the framework is that, when the displayed behavior deviates from what is typically regarded as human-like, the observer should notice. In this case, he or she is instructed to press a button (the yuck button [10]). The ratings can be used to evaluate and improve the behavior synthesis models (e.g. [14]). As observations of the subjects are continuously recorded, the framework doubles as a tool for study into nonverbal behavior.
منابع مشابه
Online Backchannel Synthesis Evaluation with the Switching Wizard of Oz
In this paper, we evaluate a backchannel synthesis algorithm in an online conversation between a human speaker and a virtual listener. We adopt the Switching Wizard of Oz (SWOZ) approach to assess behavior synthesis algorithms online. A human speaker watches a virtual listener that is either controlled by a human listener or by an algorithm. The source switches at random intervals. Speakers ind...
متن کاملWizard of Oz Experiments on Speech Dialogue Systems Design and Realisation with a New Integrated Simulation Environment
The Wizard of Oz simulation technique is an approved aid for designing speech dialogue systems. There are a number of tools for simulation which have been used successfully, but which are inflexible in terms of the application domain, support for additional modalities, or integration with existing dialogue design tools. This thesis describes the design and realisation of Wizard of Oz experiment...
متن کاملWizard of Oz Studies with Children
This paper provides an overview of the literature pertaining to Wizard of Ox studies with children participants. It presents a new taxonomy for Wizard of Oz evaluations and, whilst focusing on three case studies that have been carried out by the authors, provides a presents a discussion of several ethical and organizational concerns with Wizard of Oz as a method for use with child participants....
متن کاملPuppet Prototyping: Wizard of Oz Support throughout an Iterative Design Process
Although the Wizard of Oz (WOz) method for simulating system components is commonly used for evaluation in HCI, researchers and designers have only started to unlock the potential of this technique. In this paper, we review the Wizard of Oz method and highlight its usefulness throughout the evolution of a user interface or system. We point toward a design space for WOz simulation, where the Wiz...
متن کاملDifferences in User Responses to a Wizard-of-Oz versus Automated System
Wizard-of-Oz experimental setup in a dialogue system is commonly used to gather data for informing an automated version of that system. Previous work has exposed dependencies between user behavior towards systems and user belief about whether the system is automated or human-controlled. This work examines whether user behavior changes when user belief is held constant and the system’s operator ...
متن کامل